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Abstract 

Os 

Studying Nash dynamics is an important approach for analyzing the outcome of games with 
repeated selfish behavior of self-interested agents. Sink equilibria has been introduced by Goemans, 
Mirrokni, and Vctta for studying social cost on Nash dynamics over pure strategies in games. 
O However, they do not address the complexity of sink equilibria in these games. Recently, Fabrikant 

(D and Papadimitriou initiated the study of the complexity of Nash dynamics in two classes of games. 

In order to completely understand the complexity of Nash dynamics in a variety of games, we study 
^SJ the following three questions for various games: (i) given a state in game, can we verify if this state 

is in a sink equilibrium or not? (ii) given an instance of a game, can we verify if there exists any 

r , sink equilibrium other than pure Nash equilibria? and (iii) given an instance of a game, can we 

^1— verify if there exists a pure Nash equilibrium (i.e, a sink equilibrium with one state)? 

In this paper, we almost answer all of the above questions for a variety of classes of games 
^ with succinct representation, including anonymous games, player-specific and weighted congestion 

I ' games, valid-utility games, and two-sided market games. In particular, for most of these problems, 

we show that (i) it is PSPACE-complete to verify if a given state is in a sink equilibrium, (ii) it is 
NP-hard to verify if there exists a pure Nash equilibrium in the game or not, (iii) it is PSPACE- 
complete to verify if there exists any sink equilibrium other than pure Nash equilibria. To solve 
these problems, we illustrate general techniques that could be used to answer similar questions in 
ff^ other classes of games. 
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1 Introduction 



A standard approach in studying the outcome of a system involving self-interested behavior of agents 
is to investigate the Nash dynamics of the corresponding games. In Nash dynamics, agents repeatedly 
respond to the current state of the game by playing a best-response strategy. Studying such dynamics 
is very important for understanding the behavior of a system throughout time, and the outcome of 
the game after many repeated game play. Similar to the recent efforts in studying the complexity 
of game theoretic concepts such as mixed Nash equilibria [8l S] , and pure NE [TOl EI] , studying the 
complexity of Nash dynamics can help us better understand the outcome of a game. 

In an attempt to study such dynamics for pure strategies, Goemans, Mirrokni, and Vetta [T5] 
introduced the concept of sink equilibria in games: sink equilibria are strongly connected components 
of a strategy profile graph associated with the game with no outgoing edges. Equivalently, sink 
equilibria characterize all states for which the probability of reaching that state after a sufficiently large 
random best-response sequence is nonzero. Also any random best-response sequence will converge to 
a sink equilibrium with probability one. Moreover, sink equilibria generalize pure Nash equilibria in 
that a pure Nash equilibrium is a single-state sink equilibrium of the game. 

Goemans et al. |15i studied sink equilibria for their social cost in two classes of games. However, 
they did not consider the complexity of sink equilibria or Nash dynamics in those games. Recently, 
Fabrikant and Papadimitriou ^Uj initiated the study of the complexity of sink equilibria, by studying 
the problem of verifying if a state is in a sink equilibria for two classes of games. Extending on these 
ideas, we formalize several questions related to Nash dynamics of various games and completely study 
the complexity of the Nash dynamics and sink equilibria in these games. 

Sink equilibria characterize all strategy profiles in the game with a nonzero probability of reaching 
them after a long enough best-response walk. Therefore, given a strategy profile, in order to verify 
if there is a non-zero probability of reaching this state after a sufficiently long random best-response 
walk we need to verify if this state is in a sink equilibrium or not. This problem has been considered 
by Fabrikant and Papadimitriou [11] for two classes of games, and is as follows: 

In a Sink problem. Given an instance of a game and a strategy profile in this game, can we verify 
if this strategy profile belongs to any sink equilibria or not? 

For a given state in a game, an interesting problem is to estimate the probability of reaching this 
state after a long random best-response walk. Note that a hardness result for in a sink problem 
implies that for a given state, even approximating this probability is a computationally hard problem, 
(since distinguishing the probability of zero and nonzero is hard). Fabrikant and Papadimitriou 
showed that in a sink problem is PSPACE-hard for graphical games and a BGP next-hop routing 
game |llj . We show that this problem is PSPACE-complete for weighted/player-specific congestion 
games, valid- utility games, two-sided market games, and anonymous games. The proofs for all the 
above games except anonymous games are similar and based on a reduction from halting problem of a 
space bounded Turing machine. The proof for anonymous games has unique features and is different 
from the rest. 

Given an instance of a game, it is very helpful to know if the random repeated self-interested actions 
of the agents in the game can cycle forever or such dynamics will converge to a pure Nash equilibria 
with probability one. This problem is related to characterizing the structure of sink equilibria in a 
game, and in particular the existence of non-singleton sink equilibria. Having such a sink equilibrium 
indicates that even random Nash dynamics may also converge to an everlasting cycle. As a result, we 
formalize the following problem in games: 

Has a Non-singleton Sink problem. Given an instance of a game, can we verify if this game 
possesses a non-singleton sink equilibrium, i.e., sink equilibria other than pure Nash equilibria. 
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Pure Nash equilibria (if they exist) are local optima of the Nash dynamics. Other than the problem 
of computing a pure Nash equilibrium in various games, the problem of verifying if such equilibria 
exist has been studied for various classes of games. We complement the previous questions with the 
following problem: 

Has a Singleton Sink problem. Given an instance of a game, can we verify if this game possesses 
a pure Nash equilibrium (singleton sink equilibrium)? 

Answering all the above questions for a game gives a thorough understanding of the complexity 
of Nash dynamics and the complexity of characterizing sink equilibria in that game. 
Our Results. We study the above four problems in a variety of games with succinct representation 
including player-specific and weighted congestion games, anonymous games, valid-utility games, and 
two-sided market games. All of these games are well-studied for their existence of pure Nash equilibria, 
complexity of mixed and pure NE, or /and their price of anarchy for different social functions |il3l [T9l 
[T8l [TBI |6]. To solve these problems, we illustrate general techniques that could be used as tools to 
answer similar questions for other classes of games. 

Fabrikant and Papadimitriou showed that in a sink problem is PSPACE-hard for graphical games 
and a BGP next-hop routing game ^llj. They posed this problem as an open question for weighted 
congestion games, and valid-utility games. We show that this problem is PSPACE-complete for 
weighted/player-specific congestion games, valid- utility games, two-sided market games, and anony- 
mous games. The proofs for all the above games except anonymous games are similar and based 
on a reduction from halting problem of a space bounded Turing machine. The proof for anonymous 
games has unique features and is different from the rest. The hardness of the in a sink problem in 
anonymous games is despite the fact that approximate pure Nash equilibria can be computed in these 
games in polynomial time [?]. 

For Has a non-singleton sink problem, we prove that it is PSPACE-complete for weighted/player- 
specific congestion games, valid-utility games, two-sided market games, and anonymous games. The 
reductions for Has a non-singleton sink problem extend the proofs for the in a sink problem. 

Has a singleton sink problem has been well-studied for all games in this paper except for 
valid-utility games and two-sided market games. We show that HAS A singleton sink problem 
is NP-hard for these games as well. Our results for two-sided markets characterize the complexity 
of existence of a stable matching in many-to-one two-sided matching markets; an extensively studied 
problem in the economics literature |13| [20| [T7| . Existing results for many-to-one two-sided markets 
give sufficient conditions for existence of stable matchings (or pure Nash equilibria) in different variants 
of the problem [T31 EOl [T7] , but they have not explored the complexity of verifying the existence of 
stable matchings (or pure Nash equilibria) in these games. 

Related Work. Prior to this paper, the Has a non-singleton sink problem has not been studied 
for any of the above games. In a sink problem has been studied only for graphical games yjj. Has 
singleton Sink problem, however, has been studied extensively for all the above games except valid- 
utility games and two-sided market games. In fact, it has been shown that HAS A SINGLETON SINK 
problem is NP-hard for weighted congestion games and local-effect games [9 , player-specific congestion 
games pj , graphical games pjj , and action-graph games [lLGj . For anonymous games it has been shown 
that hat an approximate NE are computable in polynomial time[7] and that HAS A singleton sink 
is TC°-complete[3]. 

There has been a recent significant progress in understanding the complexity of equilibria in games. 
The complexity of mixed Nash equilibria is now well-understood by the recent results on PPAD-hard- 
ness of computing mixed NE[8l S] , and even for computing approximate mixed NEj5] . The complexity 
of pure Nash equilibria in various games (especially congestion games) have also been well-studied 
by recent results on PLS-completeness of computing a pure Nash equilibrium [TOl H], and even for 
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computing an approximate pure NE [21]. 

2 Preliminaries 

2.1 General Definitions 

Strategic games. A strategic game (or a normal-form game) A =< N, (Sj), (ui) > has a finite set 
A'^ = {1, . . . , n} of players. Player i £ N has a set Sj of strategies (or strategies). The whole strategy 
set is S = Si X • • • X S„ and a strategy profile S £ T, is also called a profile or state. The utility function 
of player i is Uj : S ^ M, which maps the joint strategy S" G S to a real number. Let S = (si, . . . , s„) 
denote the profile of strategies taken by the players, and let s_j = (si, . . . , Si-i, Sj+i, . . . , s„) denote the 
profile of strategies taken by all players other than player i. Note that S = (sj, s_j). An improvement 
move for a player i in a profile S" is a move for which iij(s„,,s9 > Ui{S). A best response move 
5" for a player i in a profile S is an improvement move that has the maximum utility. Note that in 
cost minimizing games, each player i wants to minimize the cost Ci{S) = —Ui{S) in strategy profile 
S. This type of games include congestion games with delay functions on edges which will be defined 
later. 

Nash equilibria (NE): A strategy profile 5 G E is a pure Nash equilibrium if no player i £ N can 
benefit from unilaterally deviating from his strategy to another strategy, i.e., yi £ N Vs^ £ Sj : 
Ui{s-i,s'^ < Ui{S). We can also define a-Nash equilibria as follows. For 1 > a > 0, a state S is an 
a-Nash equilibrium if for every player i, Ci{s-i, s'J > (1 — a)ci{S) for all s[ £ Sj. 
State graph. Given any game A, the state graph G{A) is an arc-labeled directed graph as follows. 
Each vertex in the graph represents a joint strategy S. There is an arc from state S to state S' with 
label i iff there exists player i and strategy s[ £ Sj such that S' = (s__j,s9, i.e., S' is obtained from 
S by a move of a single player i that improves his utility from S to S' . 

Nash dynamics. A Nash dynamics or best-response dynamics is equivalent to a walk in the state 
graph. 

Sink equilibria. Given any game A, a sink equilibrium is a subset of states T that form a strongly 
connected component of the state graph such that there is no outgoing edge from states in T to any 
state outside T. As a result, any pure Nash equilibrium of a game is a single-state sink equilibrium, 
and a game may have several sink equilibria. 

2.2 Definition of games 

(Unweighted) Congestion Games. An (unweighted) congestion game is defined by a tuple < 
N, E, (Sj)jg7V) {de)e€E > where is a set of resources, Sj C 2^ is the strategy space of player i, and 
de : N — > Z is a delay function associated with resource e. For a strategy profile S = (si, . . . ,Sn), 
we define the congestion ne{S) on resource e by ne{S) = \{i\e £ Si}\, that is ne{S) is the number of 
players that selected an strategy containing resource e in S. The cost (or delay) Ci{S) of player i in a 
strategy profile S is Cj(5) = -Ui{S) = Xleesi de{ne{S)). 

In weighted congestion games, player i has weighted demand Wi. In this game, the congestion 
(load) on resource e in a state S, denoted by by le{S) is as follows le{S) = Yli\eesi '^^^ ^^^^ 
delay of players is defined the same way as the congestion games. A player-specific congestion game is 
defined by a tuple < A^, E, (Sj)jg7V) {de,i)e€E,i£N > where E and Sj C 2^ are the same as congestion 
games, and de,i : N ^ Z is a delay function associated with resource e and player i. The congestion 
ree(S') on resource e is defined the same as congestion games. The cost (delay) Cj(S') of player z in a 
strategy profile 5 is Cj(S') = -Ui{S) = X^eGsi de,i{ne{S)). 
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Many-to-one Two-sided Markets. We model the many-to-one two-sided market {X ,y) between 
two sides of active agents X and passive agents 3^ as a game G{X,y) among active agents x ^ X. 
The strategy set of each active agent x & X is s, lower-ideal family of subsets of passive agents J-x 
where J-x ^ 2-^, i.e., an active agent x (z X can play a subset Sx G J'x of passive agents. Each agent 
X ^ X also has a preference (a.k.a social choice) over its strategies. This preference is capture by a 
utility function : 2-^ — > M which assigns a utility, Ux{T), to each subset T C 3^. Each agent y y 
has a strict preference list over the set of agents x £ X that can play this set, i. e., x is preferred to x' 
by y iff Uy{x) > Uy{x'). We assume that Uy{x) / Uy{x') for any two agents x and x'. Given a vector 
of strategies S = (si, . . . ,Sn) for active agents, agent y is matched to the best agent x £ X in the 
preference list of agent y such that y G Sx- In this case, we say that x is the winner of agent y, or 
equivalently, agent x wins agent y. The goal of each active agent x is to maximize the utility of the 
set of passive agents that she wins. Given a strategy profile S, let Tx{S) C Sx be the set of passive 
agents that agent x wins. The utility of player x in strategy profile S is equal to Ux{Tx{S)), the goal 
of X is to maximize this utility. 

It is not see that pure Nash equilibria of the above game correspond to stable matchings for 
many-to-one two-sided markets as defined by ... 

Valid-utility Games. Here we briefly define the class of valid-utility games; see [22] for more details. 
In valid-utility games, for each player i, there exists a ground set of markets Vi. We denote by V the 
union of ground sets of all players, i.e., V = Uigj/Vi. The feasible strategy set Fi of player i is a subset 
of the power set, 2^% of Vi. Thus, a strategy Si of player i is a subset of Vi {si C V)- The empty set, 
denoted 0j for player i, corresponds to player i taking no action. 

Let G{U, {Fi\i G U}, {ui{)\i G U}) be a non-cooperative strategic game where Fi C is a family of 
feasible strategies for player i. Let V = Lli^ijVi and let the social function be 7 : Ili^u2^ —>■ M"*" U {0}. 
Then ^ is a valid-utility game if it satisfies the following properties: (1) The social function 7 is 
submodular and non-decreasing, (2) The utility of a player is at least the difference in the social 
function when the player participates versus when it does not participate, and (3) For any strategy 
profile, the sum of the utilities of players should be less than or equal to the social function for that 
strategy profile. 

This framework encompasses a wide range of games including the facility location games, traffic 
routing games, auctions [22], market sharing games [TJ], and distributed caching games [12]. In [22] 
it was shown that the price of anarchy (for mixed Nash equilibria) in valid-utility games is at most 2. 
Anonymous games. Anonymous game[6j are games in which players have the same strategy sets, 
but different utilities for the same strategies; however, these utilities do not depend on the identity of 
the other players, but only on the number other players taking each action. An interesting subclass 
of these games is anonymous games with a constant-size strategy set in which the size of the strategy 
set of players is a fixed constant. 

3 Existence of Pure Nash Equilibria 

In this section, we study the Has a Pure problem for succinct games. This problem has been already 
considered and resolved for weighted congestion games [] and player-specific congestion games [] . We 
resolve this problem for many-to-one two-sided markets and valid-utility games. The result for two- 
sided markets imply that given an instance of the many-to-one stable matching problem, verifying if 
there exists a stable matching is NP-hard. 



A family of subsets is lower-ideal if and only if for any subset S £ T and S' £ S, then 5*' £ T. 
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Theorem 1. Has a singleton Sink is HP-hard for (i) uniform utility-based two-sided market games, 
(a) many-to-one two-sided market games, and (Hi) valid-utility games. 

Proof To prove NP-completeness, we give a reduction from the 3Sat problem. Given an instance of 
the 3Sat problem, we construct an instance of the utility-based two-sided market game as follows: 
for each variable Xi , we put a player Xi with a one and a zero strategy. For each clause cj , we put two 
players Cj and Kj each with a one and a zero strategy. We construct the game such that Cj and Kj 
have a cycle of best responses if and only if the clause is not satisfied. In other words, if the X-players 
choose a strategy profile that satisfies all clauses, all clause players eventually reach a stable solution. 

The zero strategy of Cj is {aj,bj} and the one strategy is {cj}. The zero strategy of Kj is {aj} 
and the one strategy is {bj} U {rj^j|for all variablesxj in clause cj}. The a-markets have utility 305 
and prefer the players. The 6-markets have utility 8 and prefer the C-players. The c-markets 
have utility 310. The r-markets have utility 100 and prefer the X-players. Note that there is a best 
response cycle of Cj and Kj if and only if none of the three rj^j-markets is allocated by an X- player. 

The zero strategy of a player Xi is {rij\xi £ cj} U {pij\xi G cj}. The one strategy of a player Xi 
is {rij\xi S Cj} U {pij\xi E cj}. The p-markets have utility 100. Note that both strategies have the 
same utility for a X-player independent of the strategy profile of other players. Furthermore, Xi gets 
the utility from rij, if and only if it satisfies clause Cj, □ 

The above theorem implies that given an instance of the many-to-one stable matching problem, 
the problem of verifying if this game has a stable matching or not is NP-hard. Known results in the 
economic literature for many-to-one two-sided markets discuss necessary and sufficient conditions for 
existence of stable matchings (or pure Nash equilibria) for different variants of two-sided markets \13\ 
[20] [T7] . however, before our results, the known results have not addressed the complexity of verifying 
the existence of stable matchings (or pure Nash equilibria) given an instance of these markets. 

4 Sink Equilibria and Weighted Congestion Games 

In this section, we study the complexity of the In a Sink and Has a Sink problem for weighted 
congestion games. The interesting aspect of this proof is that we can use similar reductions for a 
variety of games with succinct representation. Applying this proof on many examples shows the 
strength of the proof technique. 

Theorem 2. In a Sink is PSPACE-hard for weighted congestion games. 

Proof. We give a reduction from the space-bounded halting problem for Turing machines. First, we 
reduce an instance of this problem (a TM M, an input x and a tape bound t) to the halting problem 
for a TM M' = {Q, E, b, F, 6, qo, {qh}) which simulates M on x without its own input. Let S = {0, 1} 
and F = {0, 1,6}. Starting from an empty tape, M' halts if and only if M rejects x . Furthermore, M' 
uses additional tape cells and states for a counter that counts up to the total number of configurations 
of M. When M accepts, the counter overflows, or M exceeds the tape bound t, M' erases the whole 
tape, moves the head to the initial position and returns to state go- uses tape cells only right of its 
initial position and at most t' tape cells. Note that starting from every total configuration M' never 
stops only if M rejects x. 

To complete the proof, we construct a congestion game Gm' that simulates Turing machine M' . 
A strategy profile s which we define later is in a sink equilibrium if and only if M' runs forever. The 
game consists of three types of configuration players, a transition player, a set of control players, and 
a clock player. The first type of configuration players is a state player with \Q\ strategies. The second 
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type of configuration players is a position player for the position of the head with t' strategies; and 
the third type of configuration players is a set of cell players cellj for each tape cell < i < t' with 
the |r| strategies for the content of the tape cell i. There is a simple bijective mapping between the 
strategy profiles of the configuration players and the configurations of M' . 

The game is constructed in such a way that every sequence of improvement steps can be divided 
in rounds. At the end of a round i, let Cj be the configuration obtained from the strategy profile of 
the configuration players. For every sequence of improvement steps, ci h C2 1~ C3 h . . . denotes the run 
of M' starting from ci. 

We now describe our construction in more details. The strategies of the configuration players are 
described in Figure [TJ Every strategy of a configuration player has two unique resources, an a resource 
and a P resource. The a resources have delay if allocated by one player and delay 1 otherwise. The 
(3 resources have delay if allocated by one player and delay M otherwise. 



state player 



strategies 


resources 


delays 


qeQ 




0/1 
0/M 



position player 



strategies 


resources 


delays 


0<i<t' 


a' 


0/1 
0/M 



player cellj with < i <t' 



strategies 


resources 


delays 


a G r 


/5r 


0/1 
0/M 



Figure 1: Definition of strategies of the three types of configuration players 



Player Control 



W,q,i,i' ,cr 



Strategy 


Resources 


Delays 


Zero 


Pw,g,i,i',(7 
^W,q,i,i',a 


0/M 
0/1 


One 


Pw,q,i,i',(7 
'^W,q,i,i',a 


0/M 
0/1 



Player Control 



V,q,i,i' ,cr 



Strategy 


Resources 


Delays 


Zero 


Pv,q,i,i',CT 
^V,q,i,i',a 


0/M 
0/1 


One 


Pv,q,i,i',a 
^V,q,i,i',a 


0/M 
0/1 



Control £) 



Strategy 


Resources 


Delays 


Zero 


PI 


0/M 
0/1 


One 


P'd 


0/M 
0/1 



Figure 2: Strategies of the control players, for each g € Q, < i < n, i' G {i — 1, 1, i + 1}, and cr G F 



Each control player has two strategies. Zero and One, which are constructed in the same manner 
like strategies of configuration players (see Figure [2]). The transition player has the following strategies 
Wait, Done, Halt, and several strategies Readg^j^o-, Writeg/^j/^j^o-'j and Verifyg/^j/^j^o-/ (for each G 
{1, . . . q^q' G Q, and a, a' G S). The details of theses strategies and the resources they contain 
are listed in Figure |3| The clock player has two strategies. Trigger and Wait. Trigger contains the 
two resources, TriggerMain and TriggerClock. The strategy Wait contains one resource with constant 
delay of 110. 

Let us remark that each a- or /3-resource is allocated by at most two players; the transition player 
and one of the configuration or control players. The general idea is that the improvement steps for 
the transition player is determined by the strategy profile of the configuration and control players. 
That is, the transition player never deviates to a strategy that contains a /^-resource which is allocated 
by another player. On the other hand, the transition player determines the improvement steps for 
configuration and control players if he allocates a-resources. Note that each a-resource is associated 
with exactly one strategy of exactly one configuration or control player. 

Now, we are ready to describe the aforementioned sequence of improvement steps that corresponds 
to one round in more details. Consider any strategy profile in which the clock players are on Trigger, 
the transition player is on Wait and all control players except control/) are on One. Let q be the 
strategy of the state player, i the strategy of the position player and do, • • . , (Tf the strategies played 
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Strategy 


Resources 


Delays 


Wait 


Pw,q',i',i,^'^(^vl'^ ^' ^' foi' all i', cr' 
TriggerMain 


0/M 
0/1 

0/100/100 




/?P for all peQ\q 


0/M 


for each q £ Q, 


for all J 7^ i 


0/M 


< i < t' and cr e r 


Pf for all fj' G r \ cj 


0/M 




"H/,g',j',j,<7' ■^itl^ ^(9' ^) = (^'' a^"^ i' = i + d 


0/M 
0/1 




N.N.' 


80 




for all p€Q'\q' 


0/1 


for each q' £ Q, < i < t' , 


for all j 7^ 


0/1 


i' G {i - + 1}, 


af for all <T e r \ a' 


0/1 


and cj' G r 


f^W,q' ,i' ,i,cT' 


0/1 
0/M 




N.N. 


60 


Verifyq'^i'^iy 


/?P for all G g \ 


0/M 


for each q' G Q, < i < t' , 


for all j / i' 


0/M 


i' e + 1}, 


/3f for all fj G r \ fj' 


0/M 


and a' gT 


N.N. 


0/M 

0/1 

40 


Done 


triggerClock 


0/0/20 




"Wy,i,<x" ai/,g',i',i,<7' for ah g',i',i,cT' 
N.N. 


0/M 

0/1 

20 


Halt 


for all qeQ\qh 


0/M 



Figure 3: Definition of strategies of the transition player. Resources that are denoted by N.N. are 
used by the transition player only and have a constant delay. 

by the players ccUq, . . . , cellf. Figure |4] describes the sequence of improvement steps emerging from 
this strategy profile. The strategy profile at the end of the round differs from the initial one only in 
the choices of the configuration players. The deviations of the configuration players corresponds to a 
step of the Turing machine M' . Note that this sequence is essentially unique as there are no other 
improving deviations. If and only if the state player is on q^, the transition player may deviate to the 
strategy Halt. This is a Nash equilibrium of Gm'- Now let s be a strategy profile in which the clock 
players is on Trigger, the transition player on Wait, and all control players except controls on One. 
Let the configuration players' choice in s correspond to the initial configuration of M'. Then, s is in 
a sink equilibrium if and only if M' does not halt. 

□ 

We now consider the problem Has a non-singleton Sink for weighted congestion games. 

Theorem 3. Has a non-singleton Sink is PSPACE-hard for weighted congestion games. 

This results follows from the proof of Theorem |2] and the following Lemma. The lemma implies 
that there is at most one unique sink equilibrium in the constructed game. 
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(1) The transition player deviates from Wait to Readg^ j^o-^ . 

(2) Player controliy,g',j',j,(T' deviates to Zero. 

(3) The transition player deviates to Writeg'^j'^j^o-'- 

(4) The configuration players deviate to the new configuration 
and the player control v,g',i',i,o-' deviates to Zero. 

(5) The transition player deviates to Verifyg/^j/^j^o-'- 

(6) The player control^ deviates to One. 

(7) The transition player deviates to Done. 

(8) The clock player deviates to Wait and 

the controU players except control/^ deviate to Zero 

(9) The transition player deviates to Wait. 

(10) The clock player deviates to Trigger and 
the player controli) deviates to Zero 



Figure 4: Description of a round. 

Lemma 4. Every Sink equilibrium contains a strategy profile in which the clock player is on Trigger, 
the main player on Wait and all controli players on their Zero strategy. 

Proof. If no player has delay M or greater, the game converges as described in Figure |4] and eventually 
reaches a strategy profile in which the clock player is on Trigger, the main player on Wait and all 
controli players on their Zero strategy. Note that no strategy profile with a player having delay M or 
greater is reachable. If players have delay of M or greater, there is a sequence of improvement steps 
such that no player has delay of M or more, e.g. each control or configuration player with delay of 
M changes to another strategy. □ 

Thus, every sink equilibrium also contains the strategy profile that corresponds to the initial 
configuration of M' . Therefore, there is a unique sink equilibrium if and only if M rejects x. 

5 Sink Equilibria and Player- Specific Congestion Games 

Theorem 5. In a Sink is PSP/KCE-hard for player- specific congestion games. 

One can easily replace the clock player in the construction which is the only player with non- 
uniform weight by a player with weight 1 and modify the (player-specific) delay functions as follows. 
For the transition player the resource TriggerMain has delay if one player allocates it and delay 100 
otherwise. For the clock player the resource TriggerMain has always delay 100. The delay functions 
of the resource TriggerClock is identical for both players. It has delay if one player allocates the 
resource and delay 20 for two or more players. For each strategy profile the delay for each player is 
identical to the delay in the previous example. 

Theorem 6. Has a non-singleton Sink is PSPACE-hard for player-specific congestion games. 
Proof. This result follows by the same argument as for Theorem [3| □ 
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6 Sink Equilibria and Anonymous Games 



Next, we consider anonymous games with constant-size strategy set and show that IN A SINK for this 
game is also PSPACE-complete. 

Theorem 7. In a Sink is PSPACE-Ziarc? for anonymous games with constant- size strategy sets. 

We give a reduction from the halting problem of a space bounded Turing machine M' as defined 
in the proof of Theorem [2j Additionally, we assume that states of M' are denoted hy q^, ... ,qm where 
qm is the halting state. We construct an anonymous game with a constant number of strategies. Each 
player has a set of {allowed) strategies. Every strategy that is not allowed always has utility 0. The 
only other utility values in the game are 1 and 2. Given a strategy profile s = (si, . . . ,5^), let \si\ 
denotes the number of players that play strategy Sj. 

The game consists of the three types of configuration players and five types of auxiliary players and 
two control players. The strategy choices of the configuration players can be mapped to configurations 
of the TM M' . Every sequence of improvement steps can be partitioned into rounds. Each round 
simulates one step of M' . At the end of a round i, let Cj be the configuration obtained from the strategy 
profile of the configuration players. For every sequence of improvement steps, 01^02^03!-... equals 
the run of M' starting from ci. 

We first describe the configuration players before we describe the remaining players and the process 
that simulates one step of M' . The first type of configuration players are \Q\ identical state players 
that choose between the two actions state^ and state^. For j = |state^| corresponds to M' being 
in state qj. The second type are t' identical position players that choose between the two actions 
position^ and position''. For p = |position^| corresponds to the head of M' being in position p. The 
third type are the cell players cello, . . . jCell^/ which choose between the actions cell'', cell^, cell'', and 
change. Unlike the previous two types of players, the cell players are non-identical, i.e., each player has 
a different payoff function. For each 1 < z < t', player cellj on action cell" (cell^ or cell'') corresponds 
to the fact that tape cell i contains the symbol (1 or blank). 



Players 


allowed strategies 


celli , . . . , cellt' 


cell", cell cell'', change 


positioni , . . . , position^/ 


position^, position" 


statei ? • • • ; state^T^ 


state^, state" 


tapei, . . . , tape^/ 


tape", tape""^, tape'' 


symbol 


symbol", symbol^ , symbol 


new-sym 


new-sym", new-sym^, new-sym'' 


new-posi, . . . , new-pos^/ 


new-pos"*^, new-pos" 


new-statei, . . . , new-statem 


new-state"*^, new-state" 


transition! 


init, tape-change, eval-tape, new-sym, new-sym2, new-pos, 
new-pos2, new-state, new-state2, halt 


transition2 


Xinit, Xtape-change, Xeval-tape, Xfnew-sym, Xnew-sym2, 
Xnew-pos Xnew-pos2, Xnew-state, Xnew-state2 



Figure 5: Players and their strategies 

There are five types of auxiliary players and two control players. All players and their allowed 
strategies are listed in Figure [5] The utility functions for each player are described in Appendix |B| 
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The players tapei, . . . , tapef have identical payoff functions. They are used to evaluate symbol at the 
current position. The player symbol saves this symbol. The players new-sym, new-posi, . . . , new-pos^/, 
new-statei, . . . , new-state^ calculate the changes to the configuration. The control players ensure that 
strategy changes happen in a certain order that corresponds to one step. 

Lemma 8. Let c be a configuration of M' and d the successor configuration. Every sequence of 
improvement steps from a strategy profile in which the configuration players play corresponding to c 
and the first control player is on init, reaches a strategy profile in which the configuration players play 
corresponding to c' and the first control player is on init. 

Proof. We now describe this sequence of improvement steps which we call a round. It is listed in 
Figure [T] in detail. One can easily check for each of the strategy profiles that the next one is essentially 
unique. 

In a round, the first control player successively changes through his strategies (c.f. steps (2), (4),...). 
The second control player follows his choices in his corresponding strategies. By construction of the 
payoff function, this ensures that the control players only change their strategies in a certain order. 
Each of these steps of the first control player is interrupted by improvement steps of subsets of 
configuration or auxiliary players. The utility functions (cf. Figure [?]) are designed in such a way that 
these improvement steps are possible if and only if the control player plays the corresponding strategy. 
Additionally, the control player may only continue with his next step after these other player have 
changed their strategies (cf. Figure [8| . 

We now describe the improvement steps of the configuration and auxiliary players only. Consider 
any strategy profile of the configuration players and assume the first control player is on init (strategy 
profile (1) in Figure [T] in Appendix [b} The t' tape players change to a strategy profile in that the 
number of players on tape'', tape^, and tape'' equals the number of players on celP, cell^, and cell'' (2). 
The player cellj with i = | position*! changes to his strategy to change (4). The symbol player changes 
to symbol^, symbol^, or symbol'' depending on which strategy was left by the player celh (6). This 
can be coded into the utility function by evaluating the difference of number of players in the cell and 
tape strategies. The player new-symbol changes to the strategy new-symbol*^ where a' corresponds to 
the new symbol (8). This can be coded as a function as from number of players on symbol'^, symbol^, 
symbol'', and state^. The player cell, changes to the strategy cell*^ (10). Exactly i' players choose 
new-pos^ where i' is the new position of M' (12). The players position change their strategies such 
that [positional = |new-pos^| = i' (14). Exactly q' players new-state choose new-state^ where qqi is the 
new state of M' (16). The players state change their strategies such that Istate""^! = |new-state"^| = q' 
(18). The configuration players' strategy profile now corresponds to the new configuration after one 
step of M'. □ 

Theorem 9. Has a non-singleton Sink is PSPACE-hard for anonymous games. 

Proof. By construction of M' and the proof of Theorem [7] it suffices to show that every infinite 
sequence of improvement steps contains a strategy profile with player control! on init, i.e. a profile 
listed in the first row of Table [T] 

The strategy changes of control! have to occur in the same order as listed in Table [T} Therefore, 
every sequence with infinite strategy changes of control! contains a profile with control! on init. We, 
therefore, show that there is no infinite sequence that contains no strategy change of control!. Thus, 
fix any strategy choice for player control! . Observe that the utility functions of the remaining players 
(cf. Figure W\ do not allow an infinite sequence. □ 
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7 Sink Equilibria in other Games 



Theorem 10. In a Sink is PSPACE-hard for (i) uniform utility-based two-sided market games, (ii) 
many-to-one two-sided market games, and (Hi) valid-utility games. 

Theorem 11. Has a non-singleton Sink is PSPACE-hard for (i) uniform utility-based two-sided 
market games, (ii) many-to-one two-sided market games, and (Hi) valid-utility games. 

The proof is a rework of the proof for Theorem [2] and is shifted to Appendix [A] The Nash dynamics 
of the uniform utihty-based two-sided market game that we describe there is isomorphic to the Nash 
dynamics of the congestion game in the proof for Theorem [2] 
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A Proof of Theorem 



10 



The Nash dynamics of the uniform utihty-based two-sided market game that we describe here is 
isomorphic to the Nash dynamics of the congestion game in the proof for Theorem [2| Thus, ah 
properties easily transfer. The strategies of the transition player and the preferences of the markets 
can be found in Figure [6} The strategies of the remaining players can be obtained from the previous 
proof. 



Strategy 



Markets 



Utilities (Preference) 



Wait 



Read,, 



Pw,q' ,i' I'^^'^h^y' 

Pyq' , i' , i, a' for all q' , i' , i, a' 



a 



for each q £ Q, 

<i<t' and a eT 



D 

TriggerMain 

PP for all p£Q\q 

for all j ^ i 
Pf for all a' E r \ £7 





W,q',i',i,a' 



a 

and i' 
N.N. 



, j^, with 6{q,a) = {q',a',d) 



i + d 



M (Controlvy^g'^j'^j^o-') transition player) 

M (Controly^g'^j'^j^o-'j transition player) 

1 (transition player, Control^) 

100 (clock player, transition player) 

M (state player, transition player) 

M (position player, transition player) 

M (cellj, transition player) 

M (Control/3, transition player) 

1 (transition player, Controlvi/,(j',i',i,CT') 

N - {\Q\ + t' + \r\ -l)M + 20 



for each q' £ Q, < i < t' , 
i' e + 1}, 

and a' e r 



aP for all p€Q'\q' 

for all j ^ i' 
a1 for all 0- E r \ cr' 



N.N. 



1 (transition player, state player) 
1 (transition player, position player) 
1 (transition player, cellj 
1 (transition player, Controly^g'^j'^j^o-') 
M (Controli4/,5',i',i,o-')transition player) 
- M + 40 



for each q' £ Q, < i < t' , 
i' £ {i - + 1}, 
and cj' G r 



PP for all p£Q\q' 

pi for all j / i' 

Pf for all CT G r \ ct' 



a 



D 

N.N. 



M (state player, transition player) 

M (position player, transition player) 

M (cellj, transition player) 

M (Controly^g'^j'^j^o-', transition player) 

1 (transition player. Control^) 

N - {\Q\ + t' + \T\ -l)M + 60 



Done 



triggerClock 

^W,q' ,i' ,i,a' fol' all 

"v,q',i',iy foi' all q'J',h(^' 
N.N.' 



80 (transition player, clock player) 
M(Control£), transition player) 
1 (transition player, Controlvi/,(j',j',j,(T' 
1 (transition player, Controly^g'^j'^j^o-') 
N - M + 20 



Halt 



pi for all qeQ\qh 
N.N. 



M (state player, transition player) 
N - M 



Figure 6: Strategies of the transition players. Markets denoted by N.N. are used by the transition 
players only. Let N = \Q\{t + l)6|r|M 
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B Details of the proof of Theorem [7] 



Player 


strategy 


partitions with utility 2 


cellj 


change 


|tape-change| 7^ and position^ = i 




cell° 


new-tape 7^ and new-sym'' > 




cehi 


new-tape 7^ and new-sym^ > 




celr 


new-tape 7^ and new-sym^ > 


tapej 


n 

tape 


l""jl / Tl TlOl \ i fx \ 

mit 7^ and celr > tape'' 




1 

tape 


init 7^ and cell^ > tape"' 




h 

tape 


init 7^ and cell'' > tape'' 


symbol 


symbol 


eval-tape 7^ and cell" - tape" < 




1 1 1 
symbol 

1 

symbol 


eval-tape 7^ and cell^ - tape"' < 
eval-tape 7^ and cell'' - tape'' < 


new-sym 


new-sym" 


new-symbol 7^ and if is new symbol 




1 

new-sym 


new-symbol 7^ and if 1 is new symbol 




h 

new-sym 


new-symbol 7^ L) and 11 is new symbol 


new-pos 


1 

new-pos 


new-pos 7^ and new-pos'^ < new position 




n 

new-pos 


new-pos 7^ and new-pos'^ > new position 


new-state 


new-state 


new-state 7^ (J and new-state > new state 




new-state^ 


new-state 7^ and |new-state"' < new state 


position 


position^ 


new-pos2 7^ and position-' < new-pos-' 




position'' 


new-pos2 7^ and position^ > new-pos-' 


state 


state^ 


new-state2 7^ and state-' < new-state^ 




state'^ 


new-state2| 7^ and state-' > new-state^ 


halt 


state"^ = m 





Figure 7: The strategy partition combinations are listed that induce utility 2. Note that the new 
symbol, new position, and new state can be coded as a function of | symbol*^ |,| symbol-' |,| symbol''!, and 
Istate-'l. 
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JJcil LI LlUIio WiLli ULlilLy Z 




LA-llllL ^ U dliU. Ceil — Ld-pt: dllU. CCii — Ldpc 

dlUJ. l^Cll UdpC 


f^TTfl 1— "h CI T~\fi 
C V d± L CXiVJ C 


rX "h PI T^fi— r* n PI n cTfi (1 pmH u^pII— f npmcrp — 1 

J\. LdpC dldil^C ^ yJ dilvl v_/dl V/lldll^C — i. 


lie W ij\ 111 


I^PA/'pl-fpTipl n PTirl IppH^I -I- 1 ti'\/'TnKnl'^ 1 — ll'j^'np^l anrl IppII"'"! A- l<5vmHnl"'"l — II'j^t^P"'"! 
-/i-Cvcxi tcxpc- ^ \j ciiiivj. v_/di \^ oj'iiiu'ji — uctpc cxiiv_i i^cii 1^ i-LLuyji. — uctpc 

and |cell*| + |symbol''| = [tape''! 


new-sym2 


|Xnew-sym| > and |new-sym'^'| = 1 for a' = new symbol 


new-pos 


|Xnew-sym2| > and change| = 


new-pos2 


|Xnew-pos > and new-pos^ = new position 


new-state 


|Xnew-pos2| > and d |position^| = |new-pos^| 


new-state2 


|Xnew-state| > and |new-state-^ | = new state 


init 


|Xnew-state2| > and Istate-"^! = |new-state^| 


stop 


[positional = m 



Figure 8: The strategy/partition combinations of the first control player are listed that induce utility 
of 2. 



strategy 


partitions with utility 2 


Xinit 


init > 


Xtape-change 


1 tape-change 1 > 


Xeval-tape 


|eval-tape| > 


Xnew-sym 


new-sym| > 


Xnew-sym2 


new-sym2 > 


Xnew-pos 


1 new-pos > 


Xnew-pos2 


|new-pos2| > 


Xnew-state 


1 new-state 1 > 


Xnew-state2 


|new-state2| > 



Figure 9: The strategy/partition combinations of the second control player that induce utility of 2. 
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